Predicting Word Association Strengths
نویسندگان
چکیده
This paper looks at the task of predicting word association strengths across three datasets; WordNet Evocation (BoydGraber et al., 2006), University of Southern Florida Free Association norms (Nelson et al., 2004), and Edinburgh Associative Thesaurus (Kiss et al., 1973). We achieve results of r = 0.357 and ρ = 0.379, r = 0.344 and ρ = 0.300, an ρ = 0.292 and ρ = 0.363, respectively. We find Word2Vec (Mikolov et al., 2013) and GloVe (Pennington et al., 2014) cosine similarities, as well as vector offsets, to be the highest performing features. Furthermore, we examine the usefulness of Gaussian embeddings (Vilnis and McCallum, 2014) for predicting word association strength, the first work to do so.
منابع مشابه
Experiential, Distributional and Dependency-based Word Embeddings have Complementary Roles in Decoding Brain Activity
We evaluate 8 different word embedding models on their usefulness for predicting the neural activation patterns associated with concrete nouns. The models we consider include an experiential model, based on crowd-sourced association data, several popular neural and distributional models, and a model that reflects the syntactic context of words (based on dependency parses). Our goal is to assess...
متن کاملFrom Predicting Predominant Senses to Local Context for Word Sense Disambiguation
Recent work on automatically predicting the predominant sense of a word has proven to be promising (McCarthy et al., 2004). It can be applied (as a first sense heuristic) to Word Sense Disambiguation (WSD) tasks, without needing expensive hand-annotated data sets. Due to the big skew in the sense distribution of many words (Yarowsky and Florian, 2002), the First Sense heuristic for WSD is often...
متن کاملImprove Parsing Performance by Self-Learning
There are many methods to improve performances of statistical parsers. Among them, resolving structural ambiguities is a major task. In our approach, the parser produces a set of n-best trees based on a feature-extended PCFG grammar and then selects the best tree structure based on association strengths of dependency word-pairs. However, there is no sufficiently large Treebank producing reliabl...
متن کاملExploring the Relationship between Semantic Spaces and Semantic Relations
This study examines the relationship between two kinds of semantic spaces — i.e., spaces based on term frequency (tf) and word cooccurrence frequency (co) — and four semantic relations — i.e., synonymy, coordination, superordination, and collocation — by comparing, for each semantic relation, the performance of two semantic spaces in predicting word association. The simulation experiment demons...
متن کاملThe role of resilience, positive/negative emotions, and character strengths in predicting burnout of military personnel
Background: Military personnel are at high risk for burnout due to exposure to high job stress. The purpose of this study was to investigate the role of character strengths, positive and negative emotions, and resilience in predicting burnout of military personnel in Iran. Materials and methods: A sample of 146 people working in different military and law enforcement forces was selected by ava...
متن کامل